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Abstract 

This paper investigates the problem of secure lossy source coding in the presence of an eavesdropper 
with arbitrary correlated side informations at the legitimate decoder (referred to as Bob) and the eavesdropper 
(referred to as Eve). This scenario consists of an encoder that wishes to compress a source to satisfy the 
desired requirements on: (i) the distortion level at Bob and (ii) the equivocation rate at Eve. It is assumed that 
the decoders have access to correlated sources as side information. For instance, this problem can be seen 
as a generalization of the well-known Wyner-Ziv problem taking into account the security requirements. 
A complete characterization of the rate-distortion-equivocation region for the case of arbitrary correlated 
side informations at the decoders is derived. Several special cases of interest and an application example 
to secure lossy source coding of binary sources in the presence of binary and ternary side informations are 
also considered. It is shown that the statistical differences between the side information at the decoders and 
the presence of non-zero distortion at the legitimate decoder can be useful in terms of secrecy. Applications 
of these results arise in a variety of distributed sensor network scenarios. 

I. Introduction 

Consider the problem of compressing correlated sources at sensor nodes in a distributed fashion where 
the sensors may wish to communicate with each other on a wireless network. Assume also that each of 
these sensors can have access to a correlated observation to the source or random field of interest. This 
observation can be used as side information available at the decoder to minimize the distortion between 
the original source and the estimate at the legitimate decoder (referred to as Bob). In addition to this, 
we assume that each of the encoders (referred to as Alice) wishes to leak the least possible amount of 
information about its source to an eavesdropper (referred to as Eve), e.g. an untrusted sensor, who may 
capture such information during the communication between nodes. 

The work of J. Villard is supported by DGA (French Armement Procurement Agency). This research is partially supported by the 
FP7 Network of Excellence in Wireless COMmunications NEWCOM++. 
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The above scenario involves many of the major information-theoretic issues on source and channel 
coding problems. In terms of source coding, Slepian-Wolf [1| and Wyner-Ziv [2| introduced the problem 
of source coding with side information at the decoder. This topic has been the focus of intense study 
and some remarkable progress has already been made in theoretical and practical aspects. On the other 
hand, extensive research has been done during the recent years on secure communications over noisy 
channels. The traditional focus was on cryptography, based on computational complexity where security 
only depends on the intractability assumption of NP-complete problems that must be solved prior to 
decoding. Another approach is the information-theoretic notion of secrecy, introduced by Shannon in (3), 
where security is measured through the equivocation rate (i.e. the remaining uncertainty about the message) 
at the eavesdropper. The wiretap channel was introduced by Wyner J4), who showed that it is possible 
to send information at a positive rate with perfect secrecy as long as the channel of the legitimate user 
is less noisy than the channel of the eavesdropper. Csiszar-Korner [5] extends this result to the setting 
of general broadcast channels with any arbitrary equivocation rate. Several extensions of the wiretap and 
fading channels have been done (cf. J6], Q and references therein). So far, very few work has been 
reported on source coding (or compression) problems with security constraints. 

One can identify two approaches in the literature on secure source coding. In fact, it is assumed either 
that there already exists a secure rate-limited channel between Alice and Bob, which allows the system 
to use secret keys, or the decoders have access to side information about the source. In the scenario of 
secret key sharing, both lossless and lossy compression have been studied in various contexts |8), 0, ifToll . 
1111 . H21, lfT3l . For the second scenario where side information is available at both decoders, the case of 
lossless source coding has been recently studied in fl4l . lfT31l . fl6l . ifTTl . Whereas the general lossy source 
coding problem has not been fully solved, some particular cases can be derived as part of previous work. 
It is important to mention here that if the side informations between Bob and Eve are degraded then the 
result follows as a special case of fi"8l . 

In this paper, we investigate the problem of secure lossy source coding of memoryless sources in the 
presence of an eavesdropper with different correlated side informations at the decoders of Bob and Eve, as it 
is shown in Fig.Q] In this setting the channels between encoder and decoders are assumed to be noiseless so 
that they cannot provide any advantage to increase security. Our goal is to understand the minimum amount 
of information that needs to be revealed to Eve to satisfy the distortion constraint at Bob. We provide a 
complete characterization of the rate-distortion-equivocation region for the case of arbitrary correlated side 
informations. Several special cases of interest are also considered. As an application example, we consider 
the case of secure lossy source coding of a binary source, where the side information at Bob (resp. Eve) 
is the output of a binary erasure channel (resp. a binary symmetric channel) with the source as the input. 
This model is of interest since neither Bob nor Eve can always be a lessnoisy decoder. 

The organization of this paper is as follows. Section II states definitions along with the main results, 
while Section III provides several special cases and discussion. The sketch of the proofs are relegated to 
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Section IV. Finally, Section V presents an application example to binary sources and Section VI summarizes 
the paper. 



For any sequence (xi)i^», notation stands for the collection (xk, icjt+i, • • • , x n ). x r { is simply 
denoted by x n . By extension, for any subset J C {1, ...,n}, notation xj stands for the collection 
(xj)j£j. The cardinality of an alphabet is denoted by ||-||. For every e > 0, we denote e-typical and 
conditional e-typical sets by T"(X) and T™(Y\x n ), respectively. Following fl9l . entropy is denoted by 
H(-) and mutual information by /(•; •). Let X, Y and Z be three random variables on some alphabets with 
probability distribution p. If p(x\y, z) — p(x\y) for each x, y, z, then they form a Markov chain, which is 
denoted by X -e- Y -e- Z. For each x 6 M, notation [x] + stands for max(0; x). For each a, b € [0, 1], 
a-kb = a(l - b) + (1 - a)b. 



A. Problem Definition 

In this section, we give a more rigorous formulation of the context depicted in Fig. Q] Let A, B and 
£ be three finite sets. Alice, Bob and Eve observe the sequences of random variables (j4j)j e N*> (-Bj)jgpj* 
and (E^igN* respectively, which take values on A, B and £ , resp. For each i e N*, the random variables 
Ai, Bi and -B, are distributed according to the joint distribution p(a, b, e) on A x B x £. Moreover, they 
are independent across time i. 

Let d : A X A —> [0 ; d max ] be a finite distortion measure i.e., such that < d max < oo. We also 
denote by d the component-wise mean distortion on A n x A n i.e., for each a n ,b n G A n , d(a n ,b n ) = 



Notations 



II. Problem Definition and Main Results 



Definition 1: An (n,i?)-code for source coding in this setup is defined by 
> An encoding function at Alice / : A n — > {1, . . . , 2 nR }, 
. A decoding function at Bob g : {1, . . . , 2™^} x B n -> A n . 



B 





A 



W 



* Alice 



(rate R) 



> Eve 



* ±H(A n \WE n ) > A 




E 



Figure 1 . Secure lossy source coding in the presence of side information at the decoders. 
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Definition 2: A tuple (R, D, A) e K.+ is said to be achievable if, for any e > 0, there exists an 
(n, R + e)-code (/, g) such that: 

E[d(A n ,g(f(A n ),B n ))} < D + e, 

-H(A n \f(A n ),E n ) > A-e. 
n 

The set of all achievable tuples is denoted by 1Z* and is referred to as the rate-distortion-equivocation 
region. 

B. Main Result 

The proof of the following theorem is given in Sections IIV-AI and IIV-BI 

Theorem 1: Region 1Z* is the set of all tuples (R, D, A) such that there exist random variables U, V 
on some finite sets U, V, respectively, and a function A : V x B — >• A such that U -e- V -e- A -e- (B, E) 
form a Markov chain and 

R > I(V;A\B) , 

D > E[d(A, A(V, B))] , 



A < 



H(A\VB) + I (A; B\U) - I (A; E\U) 



The first two inequalities in Theorem [T] are classical in rate-distortion theory. Let us give some intuition 
on the third one. The first term H(A\VB) corresponds to the equivocation rate at Bob. Alice thus exploits 
the available distortion at Bob to increase the equivocation rate at Eve. Moreover, for a given random 
variable V, which determines the rate R and the distortion level D, auxiliary variable U may be tuned 
to make Bob more capable than Eve i.e., maximize I (A; B\U) — I (A; E\U). This quantity represents the 
gain (or the loss) at Eve in terms of equivocation rate. 

The following proposition gives upper bounds on the cardinalities of alphabets U and V. The proof is 
omitted here and will be provided in an extended version of this paper. 

Proposition 1: In the single-letter characterization of the rate-distortion-equivocation region 1Z* given 
by TheoremQ] it suffices to consider sets U and V such that ||W|| < \\A\\+2 and ||V|| < (||.4||+2)(||.4|| + 1). 

III. Special Cases of Interest 
In this section, we derive optimal regions of some special cases of Theorem Q] 

A. Lossless Secure Source Coding 

The lossless secure source coding problem corresponds to a zero distortion level at Bob (D = 0). In 
this case, the following corollary, which can also be found in |14|, [15], directly follows from TheoremQ] 
(simply set V = A): 
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Corollary 1: A tuple (R, 0, A) is achievable i.f.f. there exists a random variable U on some finite set 
U, such that U -e- A -e- (£, £7) form a Markov chain and 

R > H(A\B) , 



< 



I{A-B\U) -I(A;E\U) 



B. Bob Has No Side Information 

Corollary 2: If Bob has no side information, then region 1Z* reduces to the set of all tuples (R, D, A) 
such that there exist random variables U, A on U and A, respectively, such that U -e- A -e- A -e- S form 
a Markov chain and 

i? > J(i;A) , 
D > E[d(A,A)} , 



A < 



H(A\A) - I(A;E\U) 



Note that, as opposed to the secure lossless source coding problem 1151 . in our situation, non-zero 
secrecy is still achievable when Bob has no side information. In fact, Alice can exploit the available 
distortion at Bob to increase the equivocation rate at Eve. 

C. Bob Has Less Noisy Side Information Than Eve 

Definition 3: The side information B is less noisy than the side information E if 

I(U;B)>I(U;E) , 

for each random variable U such that U -e- A -e- (B, E) form a Markov chain. 

Note that the less noisy condition is strictly weaker than the stochastically degraded one. 

Corollary 3: If Bob has less noisy side information than Eve, then region 1Z* reduces to the set of 
all tuples (R, D, A) such that there exist a random variable V on some finite set V, and a function 
A : V x B — > A such that V -e- A -e- (B, E) form a Markov chain and 

R > I(V;A\B) , 

D > E[d(A,A(V,B))] , 



A < 



H(A\VB) + I (A; B) — I (A; E) 



In this case, random variable U of Theorem Q] is set to a constant value and hence Wyner-Ziv coding (2) 
achieves the optimal performance. 
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D. Eve Has Less Noisy Side Information Than Bob 

Corollary 4: If Eve has less noisy side information than Bob, then region 1Z* reduces to the set of 
all tuples (R, D, A) such that there exist a random variable V on some finite set V, and a function 
A : V x B -4 A such that V -e- A -e- (B, E) form a Markov chain and 

R > I(V;A\B), 

D > E[d(A, A(V,B))} , 

A < H(A\VE) . 

In this case, random variable U of Theorem [T] is set to random variable V and hence Wyner-Ziv 
coding [2 1 achieves the optimal performance. Therefore it is not surprising that the equivocation rate at 
Eve corresponds to the case where Eve can reliably decode V. Here, Alice can only exploit the available 
distortion at Bob to achieve a non-zero equivocation rate at Eve. 

IV. Sketch of Proof of Theorem Q] 

A. Proof of Achievability 

In this section, we prove the achievability part of Theorem Q] i.e., we prove the following proposition: 
Proposition 2: Let U, V be random variables on some finite sets U, V, respectively, such that U -e- 
V -e- A -e- (B, E) form a Markov chain, and A : V x B -> A. If 

R > I(V;A\B) , 

D > E[d(A, A(V, B))} , 



A < 



H(A\VB) + I (A; B\U) - I (A; E\U) 



then (R, D, A) is achievable. 
Proof: Let e > and define 



A* = [H(A\VB) + I(A;B\U) -I(A;E\U)] 



5 • max{d max , A*} 

For a sufficiently large n, we build an (n, R + e)-code (/, g) which achieves the required distortion and 
equivocation rate levels. 

1) Codebook generation: Randomly pick 2 nSl sequences u n (si) from T™(U) and divide them into 2 nRl 
equal size bins {-Bi(Vi)} ri £{i ; ... i 2' iR i}- Then, for each codeword u n (si), randomly pick 2 nS2 sequences 
v n (si,S2) from T™(V\u n (si)) and divide them into 2 nR ' 2 equal size bins {^(si, r 2)}r 2 e{i,....2 na 2}- 

2) Encoding: Assume that sequence A n is produced at Alice. Look for a codeword u n (si) such that 
(u n ( Sl ),A n ) £ T £ " (U, A). Then look for a codeword v n (si , s 2 ) such that {v n (s 2 ),A n ) e T? (V, A\u n (si)). 
Let Si(ri) and B2{s\,r2) be the bins of u n (si) and v n (s±,S2), respectively. Alice sends the message 
f(A n ) = (ri,r2) on the error-free channel. 
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3) Decoding: Assume that Bob receives (ri,r 2 ) from Alice and his side information sequence B n . 
Now look for the unique codeword u n (si) G Bi(n) such that (u n (si),B") G T' l (U,B). Then look for 
the unique codeword v n (s 1} s 2 ) G B 2 (si,r 2 ) such that (v n (s 1} s 2 ), B n ) G T"(V, B|u n (si)). Compute 



each i = {1, . . . , n}. 

4) Errors and constraints: 

. For n large enough, Pr {(A™, B") £ T"(A, B)} < 5. 

• In the first encoding step, Alice needs to find (at least) one codeword u n {s{) such that {u n (s\), A n ) G 
T?(U,A). If Si > I(U; A), then the probability that this step fails can be upper bounded by 5 for 
a sufficiently large n. Similarly, the second encoding step requires the condition S 2 > I(V;A\U) to 
succeed with a probability higher than 1 — 5. 

• In the first decoding step, Bob looks for the unique codeword u n (si) G Bi(n) such that (u n (si), B n ) G 
T"(U, B). If S\— R\ < I(U; B), then the probability that there exists another admissible codeword 
can be lowered below S for a large n. Similarly, the second decoding step requires the condition 
S 2 — i?2 < I(V; B\U) to succeed with a probability higher than 1—5. 

Defining the global transmitted rate R + e = R\ + R 2 and putting all inequalities together, we prove 
that a sufficient condition for the above code to work with an error probability lower than 55 is given by: 



where step (a), resp. (6), follows from the Markov chain U -e- V -e- (A, B), resp. V -e- A -e- B. This 
condition is verified under the given assumption on R i.e., R > I(V; A\B). 

5) Distortion at Bob: Denote by F the event "An error occurred during the encoding or decoding 
steps." We now check that our code achieves the required distortion level at Bob: 



the estimate g(ri,r 2 , B n ) G A n using the component-wise relation gi(ri,r 2 , B n ) = A(vi(si, s 2 ), Bj) for 



R + e 



Ri + R2 



> I(U; A) - I(U; B) + I(V: A\U) - I(V; B\U) 
( =' I(V; A) — I(V; B) 
= I(V;A\B), 



E[d(A n ,g(f(A n ),B n ))] < Pr{F}E d(A n , g(r u r 2 , B n )) F +Pr{F} d, 



mar 




= E 



[d(A,A(V,B))] +e 



< D + e . 
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6) Equivocation Rate at Eve: The equivocation rate at Eve can be lower bounded as follows: 

-H(A n \f(A n ),E n ) > ^i^lH{A n \ ri r 2 E n ,F) 
n n 



> 



> 



l-5<5r 



n 

1 - 5<5 r 



H(A n ) - I{A n ;nE n ) - I{A n -r 2 \r 1 E n ) 



H{A n ) ~ I{A n ; U n E n ) - H{r 2 ) 



(1 - 58) H(A) - I(A; UE) - - H{r 2 ) 



> (1 - 5(5) H{A\UE) - R 2 



> 



H(A\UE) - R 2 



5S 



H{A\UE) - R 2 



If R 2 > I(V; A\U) — I(V; B\U), the equivocation rate can thus be bounded as follows: 
1 



H(A n \f(A n ),E n ) > H(A\UE)-R 



56 



H{A\UE) - I(V; A\U) + I{V\ B\U) 



> 



H{A\UE) - R 2 



where the last inequality follows after some straightforward derivation from the definition of 6 and the 
Markov chain U -e- V -e- A -e- B. 

Thus, each A < H(A\UE) — R 2 is achievable. The above constraint on R 2 together with the latter 
inequality yield the given sufficient condition. 



Note that our coding scheme can be interpreted as a simple binning operation to transmit U, followed 
by a Wyner-Ziv coding |f2] to transmit A with side information (U, B) at Bob. 

B. Proof of Converse 

In this section, we prove the converse part of Theorem Q] i.e., we prove the following proposition: 
Proposition 3: Let (R, D, A) be an achievable tuple. There exist two random variables U, V and a 
function A : V X B — > A, such that U -e- V -e- A -e- (B, E) form a Markov chain and 

R > I(V;A\B) , 

D > E[d(A, A(V, B))] , 



A < 



H(A\VB) + I(A; B\U) - I {A; E\U) 



Proof: Let e > 0. There exists an (n, R + e)-code (/, g) s.t: 

E[d(A n ,g(f(A n ),B n ))] < D + e, 
-H{A n \f{A n ),E n ) > A-e. 
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Denote by W — f(A n ) the transmitted message. The following Markov chain holds for each subset 
Jc {l,...,n}0 

(W,Aj,Bj,Ej)-e-Aje-e-(Bjo,Ejo) . (1) 
For each i 6 {1, . . . , n}, define Ui and V% such that 

Ui = (WiBh^E*- 1 ) , (2) 

Vi = (W, A* -1 , B* -1 , B? +1 , E* -1 ) . (3) 

From Equation ([1), Ui -e- Vi -e- Aj -e- (Bi,Ei) form a Markov chain. 
7 J /toe; 

n(i? + e) > 

= I(W;A n B n E n ) 

> I(W;A n E n \B") 



(d) 



i=l 
n 

^ I(WA i ~ 1 B i ~ 1 Bi +1 E i ~ 1 ;A i Ei\Bi) - I(A 1 ^ 1 B^ 1 B^E^ 1 ; A t Ei\Bi 

2=1 

n 



i=l 



> ^/(V^A^) , 



i=i 



where 

. step (a) follows from = f(A n ), 

• step (6) from the non-negativity of mutual information, 

• step (c) from the chain rule for conditional mutual information, 

• step (d) from the independence of the random variables A, B and E across time, 

• step (e) from the non-negativity of mutual information and Equation ([3J. 

Following the usual technique, we now define an independent random variable Q uniformly distributed 
over the set {1, . . . , »}, and A = Aq, B = Bq, E = Eq, U = (Q, Uq), and V = (Q, Vq). U -e- V - 
e- A -e- (5, 7?) still form a Markov chain and (A, B, E) is distributed according to the joint distribution 
p(a,b 7 e) i.e., the original distribution of (Ai,Bi,Ei). Then 

1 " 

R + t > -Y / i (Vq;A q \b q ,Q = i) 

i=l 

= I(Vq;Aq\BqQ) 

= I(V;A\B). (4) 

1 J c denotes the complement of J in {1, . . . , n}: J c = {1, . . . , n} \ J. 
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2) Distortion at Bob: Bob reconstructs g(W,B n ). The i-th coordinate of this estimate writes 

giWBi-^BuBbJ^MVuBi) . 
The component-wise mean distortion at Bob is: 

1 ™ 

E[d(A n ,g(f(A n ),B n ))] = -J2E[d(Ai,MVi,Bi))] 

1 ™ 

= -J2E[d(A Q ,A Q (V Q ,B Q ))\Q = i] 

i—l 

= E[d(A Q ,A Q (V Q ,B Q ))] 
= E[d(A,A(V,B))] , 

where we defined function A on V x B by 

A{v, b) = A(Q, v Q , b q ) 4 A Q (v Q , b q ) . 

Consequently, 

E[d(A, A(V, B))] <D + e . (5) 

3) Equivocation Rate at Eve: 

H(A n \W, E n ) = H{A n \W) -I(A n ;E n \W) 

= H{A n \WB n ) + I(A n ;B n \W) - I(A n ;E n \W) 

= H(A n \WB n ) + I(A n ;B n ) - I(W;B n ) - I(A n ; E n ) + I(W; E n ) 

n 

( =' ^HiAAWA^B") +I(A i ;B i ) - I{A t] E t ) - I{WB? +l] B t ) + I{WE i - 1 ;E i ) 

i=l 

n 

( = } ^2 H(A i \WA i ^ 1 B n E i ^ 1 ) + I(Ai\ Bi) - I(Ai;Ei) - I(WB? +1 ;B t ) 

i=l 

+ HWtf-^Ei) + I{Ei- B^WE*- 1 ) - ^Bi-E^WB^) 

n 

= H(A i \WA i - 1 B n E i ~ 1 ) + I(A t ; B t ) - I(A i; E t ) 

i=l 

+ I(Ei; WB^E*- 1 ) - I(Bi; WB^E*- 1 ) 

n 

( = 5 H ( A iWiBi) + I(Ar, Bi) - I(Ai; Ei) + I{E V , U t ) - J(B i; U,) 



i=l 

(e) 



H(Ai\ViBi) + I (At; Bi\Ut) - I^E^Ui) , 



where 

« step (a) follows from the Markov chain W -e- A™ -e- (£?", S n ), 

• step (b) from the chain rules for conditional entropy and mutual information, and the fact that random 
variables A i7 Bi and Ei are independent across time, 
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• step (c) from the Markov chain Ai -e- WA 1 ^ 1 -e- E 1 ^ 1 (see Equation (Q]l) and Csiszar and Korner's 
equality 0, 

• step (d) from the definitions of random variables Ui and Vi (Equations (0 and (0, resp.), 

• step (e) from the Markov chain Ui -e- A; -e- (_Bj, £^). 
Now, using auxiliary random variable Q defined above, 

1 1 - 

-H(A n \W,E n ) = ~Y,H(A Q \V Q B Q ,Q = i) + I(A Q ;B Q \U Q ,Q = i)-I(A Q ;E Q \U Q ,Q = i) 

= H(A\VB) + I(A; B\U) - I(A; E\U) . 
Moreover, H(A n \W, E n ) > 0, consequently, 



H(A\VB) + I(A; B\U) - I(A; E\U) 
This proves Proposition [3] 



> A - e . (6) 



V. Application Example: Coding Binary Source with BEC and BSC Side Informations 

Consider the source model depicted in Fig. [2] where the source is binary and the side informations at 
Bob and Eve are the outputs of a binary symmetric channel (BSC) with crossover probability p 6 [0, 1/2] 
and a binary erasure channel (BEC) with erasure probability e 6 [0, 1/2], respectively, with input A. Let 
/i2 denotes the binary entropy function given by fi2(x) — —x\og 2 (x) — (1 — x) log 2 (l — x). According 
to the values of the parameters (p, e) as summarized in Fig. [3] it is not difficult to show by means of 
standard manipulations that the broadcast channel with input A and outputs {B, E) satisfies the following 
properties: 

(a) The side information E is a stochastically degraded version of B, i.e., there exists a random variable 
E such that A -e- B -e- E form a Markov chain and P^ A = Pe\a, 

(b) The side information B is lessnoisy than E, i.e., for all random variable U such that U -e-A-e-(B, E), 
I(U;B)>I(U;E), 

(c) The side information B is more capable than E, i.e., I(A; B) > I(A; E), 

(d) Any of the above relations hold between the side informations B and E. 
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2p 4p(l-p) h 2 {p) i e 

(a) ' (6) ' (c) ' (d) 

Figure 3. The different regions as a function of e. 



Observe that this model is of interest since neither Bob nor Eve can always be a lessnoisy decoder 
for all values of (p,e). Thus in general U is neither constant nor equal to V. We also remark that 
Corollary [3] provides the rate-distortion-equivocation region when e lies in regions (a) or (b). Otherwise, 
only Theorem Q] applies for the general case. 

From now on, let the distortion function at Bob d be the Hamming distance and assume for simplicity 
that the source is uniform distributed, i.e., Pr {A = 0} = Pr {A = 1} = 1/2. We know from the cardinality 
constraints given in Proposition!]] that it suffices to consider sets U and V such that ||W|| < 4 and || V|| < 12. 
Moreover, from the following proposition, we can restrict our attention to the auxiliary variables (U, V) 
obtained as the outputs of a degraded binary symmetric broadcast channel with input A, as it is depicted in 
Fig - S] Notice that V is identical to the auxiliary variable used by Wyner and Ziv [2| for the rate-distortion 
function of a binary source in the case where there is no eavesdropper. 

Proposition 4: In the case considered in this section, and depicted in Fig. [2] region 1Z* is the set of all 
tuples (R, D, A) such that there exist a, f3 £ [0, 1/2] satisfying 

R > e(l-h 2 (a)) , 
D > sa , 



< 



eh 2 (a) + (!-£) h 2 {a* fi) - h 2 (p*a* (3) + h 2 (p) 



Proof: The achievability part of Proposition [4] is a direct application of Theorem [T] define auxiliary 
random variables U and V as depicted in Fig. |U and function A on V by A(v) — v. Expressions of 
Proposition @] follow after some straightforward derivations. 

The converse part needs more arguments. The proof is omitted here and will be provided in an extended 
version of this paper. ■ 




1-/3 



1 - a 



Figure 4. Binary auxiliary random variables. 
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p = 0.100, e = 0.469 
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0.05 0.1 0.15 0.2 0.25 0.3 

D 

Figure 5. Equivocation rate at Eve as a function of the distortion at Bob. 

We now numerically compute some achievable values for p = 0.1 and e — h 2 (p) = 0.469 (see Fig. [5]). 
In the case of lossless compression (columns #1 and #2 of Table JJ, the auxiliary random variable V 
is set to be A i.e., a — 0. The additional variable U actually enables a non-zero equivocation level, as 
noted in |[T31 . Assume that the coding rate is limited to a maximum of 80% of the required rate for 
perfect reconstruction of the source (column #3). This induces a distortion of 1.5% at Bob and then an 
equivocation rate of 0.126bits at Eve is achievable. This means that even a small increase in the distortion 
at Bob can be fully exploited by Alice to achieve very significant gains (more than third times in this case) 
in terms of equivocation rate at Eve. Moreover, in the situation considered in this paragraph, Wyner-Ziv 
coding actually achieves the optimal performance for distortion levels higher than 0.036 as shown in Fig. [5] 

VI. Summary and Discussions 

The problem of secure lossy source coding of memoryless sources in the presence of an eavesdropper 
with different correlated side informations at the legitimate decoder (Bob) and the eavesdropper (Eve) 
was investigated. A complete characterization of the rate-distortion-equivocation region was derived for 



Table I 

Some achievable tuples and corresponding auxiliary random variables . 





Lossless secure source coding 


Slepian-Wolf 


Lossy secure source coding 


Wyner-Ziv 


Rate R 


0.469 


0.469 


0.375 


0.375 


Distortion D 








0.015 


0.015 


Equivocation Rate A 


0.039 





0.133 


0.126 


a 








0.031 


0.031 


P 


0.078 





0.050 
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the case of arbitrary correlated side information at the decoders. It was shown that both the available 
distortion at the legitimate decoder and the statistical properties of the side informations can be exploited 
by the encoder (Alice) to increase the equivocation rate at the eavesdropper. 

An application example to secure lossy source coding of a binary source, where the side information at 
Bob (resp. Eve) is the output of a binary erasure channel (resp. a binary symmetric channel) with the source 
as the input, was considered. This model is of interest since neither Bob nor Eve can always be a lessnoisy 
decoder and thus the encoding strategy needed to achieve the optimal equivocation rate is rather tricky. In 
the case where the source is uniformly distributed, we proved that it is optimal to consider binary auxiliary 
random variables and provided corresponding expressions for the rate-distortion-equivocation region. 

As future and on-going work, it would be of interest to extend the results in the present work to the 
more general setting in which the encoder wishes to maximize not only the equivocation rate, but also an 
arbitrary measure of the equivocation distortion at the eavesdropper. 
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